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00 ' Very recently, we proposed the row-monomial distributed orthogonal space-time block codes (DOSTBCs) and 

' showed that the row-monomial DOSTBCs achieved approximately twice higher bandwidth efficiency than the repetition- 

■ based cooperative strategy [1]. However, we imposed two limitations on the row-monomial DOSTBCs. The first one 

was that the associated matrices of the codes must be row-monomial. The other was the assumption that the relays 
, did not have any channel state information (CSI) of the channels from the source to the relays, although this CSI 

^ could be readily obtained at the relays without any additional pilot signals or any feedback overhead. In this paper, 

we first remove the row-monomial limitation; but keep the CSI limitation. In this case, we derive an upper bound of 
the data-rate of the DOSTBC and it is larger than that of the row-monomial DOSTBCs in [1]. Secondly, we abandon 
the CSI limitation; but keep the row-monomial limitation. Specifically, we propose the row-monomial DOSTBCs with 
channel phase information (DOSTBCs-CPI) and derive an upper bound of the data-rate of those codes. The row- 
monomial DOSTBCs-CPI have higher data-rate than the DOSTBCs and the row-monomial DOSTBCs. Furthermore, 
we find the actual row-monomial DOSTBCs-CPI which achieve the upper bound of the data-rate. 

Index Terms — Cooperative networks, distributed space-time block codes, diversity, single-symbol maximum like- 
lihood decoding. 
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I. Introduction 

In a cooperative network, the relays cooperate to help the source transmit the information-bearing symbols 
to the destination. The relay cooperation improves the performance of the network considerably [2]-[5]. The 
cooperative strategy of the relays is crucial and it decides the performance of a cooperative network. A simple 
cooperative strategy is the repetition-based cooperative strategy which was proposed in [4] and studied in [7]- 
[11]. This cooperative strategy achieves the full diversity order in the number K of relaysQ Furthermore, the 
maximum likelihood (ML) decoding at the destination is single-symbol ML decodable]^ However, the repetition- 
based cooperative strategy has poor bandwidth efficiency, since its data-rate]^ is just 1/K. Many works have been 
devoted to improve the bandwidth efficiency of the cooperative networks, such as the cooperative beamforming 
[12], [13], and the relay selection [14]-[16]. More attentions have been given to the distributed space-time codes 
(DSTCs) [17]-[19]. Furthermore, many practical DSTCs have been proposed in [20]-[26]. Although all those codes 
could improve the bandwidth efficiency, they were not single-symbol ML decodable in general, and hence, they 
had much higher decoding complexities than the repetition-based cooperative strategy. 

Very few works have tried to propose the DSTCs achieving the single-symbol ML decodability and the full 
diversity order. In [27], Hua et al. used the existing orthogonal designs in cooperative networks; but they found 
that most codes were not single-symbol ML decodable any more. In [28], Rajan et al. used the Clifford unitary 
weight single-symbol decodable codes in cooperative networks. The codes were single-symbol ML decodable only 
when there were four relays. Moreover, the codes could not achieve the full diversity order in an arbitrary signal 
constellation. In [29], Jing et al. applied the orthogonal and quasi-orthogonal designs in cooperative networks and 
they analyzed the diversity order of the codes. The authors of [29] claimed that the codes achieved the single-symbol 
ML decodability as long as the noises at the destination were uncorrected. However, we noticed that the rate-3/4 
code given in [29] was actually not single-symbol ML decodable, although it generated uncorrected noises at the 
destination. Actually in this paper, we will show that, when the noises are uncorrected, the codes have to satisfy 
another constraint in order to be single-symbol ML decodable. 

Only very recently, the DSTCs achieving the single-symbol ML decodability have been studied. In [1], we 
proposed the distributed orthogonal space-time block codes (DOSTBCs), and we showed that the DOSTBCs 
achieved the single-symbol ML decodability and the full diversity order. Moreover, we systematically studied 
some special DOSTBCs, namely the row-monomial DOSTBCs, which generated uncorrected noises at the des- 
tination. Specifically, an upper bound of the data-rate of the row-monomial DOSTBC was derived. This upper 
bound suggested that the row-monomial DOSTBCs had approximately twice higher bandwidth efficiency than the 

'in this paper, unless otherwise indicated, saying one code or one scheme achieves the full diversity order means it achieves the full diversity 
in an arbitrary signal constellation. 

code or a scheme is single-symbol ML decodable, if its ML decoding metric can be written as a sum of multiple terms, each of which 
depends on at most one transmitted information-bearing symbol [6]. 

^In this paper, the data-rate of a cooperative strategy or a distributed space-time code is equal to the ratio of the number of transmitted 
information-bearing symbols to the number of time slots used by the relays to transmit all these symbols. 
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repetition-based cooperative strategy. In [1], however, we imposed two limitations on the row-monomial DOSTBCs, 
in order to simplify the analysis. The first one was that the associated matrices of the codes must be row-monomiafl 
which ensured uncorrected noises at the destination. The other was the assumption that the relays did not have 
any channel state information (CSI) of the channels from the source to the relays, i.e. the channels of the first 
hop. Actually, since we assumed the destination had the CSI of the channels from the source to the relays and the 
channels from the relays to the destination in [1], the CSI of the first hop could be easily obtained at the relays 
without requiring additional pilot signals or any feedback overhead. But, it is still unknown what impact those two 
Umitations have on the data-rate of the codes. This has motivated our work. 

In this paper, we first abandon the row-monomial limitation; but keep the CSI limitation. That is, we consider 
the DOSTBCs where the noises at the destination are possibly correlated and the relays do not have any CSI of 
the first hop. We derive an upper bound of the data-rate of the DOSTBC and it is larger than that of the row- 
monomial DOSTBC in [1]. This implies that the DOSTBCs can potentially improve the bandwidth efficiency of the 
cooperative network. But, like the row-monomial DOSTBCs, the DOSTBCs may not have good bandwidth efficiency 
in a cooperative network with many relays, because the upper bound of the data-rate of the DOSTBC decreases 
with the number K of relays. Secondly, we remove the CSI limitation; but keep the row-monomial limitation. 
Specifically, the relays know the channel phase information (CPI) of the first hop and use this information in the 
code construction. Those codes are referred to as the row-monomial DOSTBCs with CPI (DOSTBCs-CPI). We 
derive an upper bound of the data-rate of the row-monomial DOSTBC-CPI and also find the actual codes achieving 
this upper bound. The upper bound of the data-rate of the row-monomial DOSTBC-CPI is higher than those of 
the DOSTBCs and the row-monomial DOSTBCs. Thus, the row-monomial DOSTBCs-CPI have better bandwidth 
efficiency than the DOSTBCs and the row-monomial DOSTBCs. Furthermore, the upper bound of the data-rate of 
the row-monomial DOSTBC-CPI is independent of the number K of relays, which ensures the codes have good 
bandwidth efficiency even when there are many relays. 

The rest of this paper is organized as follows. Section |ll] describes the cooperative network considered in this 
paper. In Section Hill we remove the row-monomial limitation; but the relays do have any CSI. Specifically, we 
study the DOSTBCs and derive an upper bound of the data-rate of the DOSTBC. In Section |IV] the relays exploit 
the CPI to construct the codes; but the row-monomial limitation is maintained. Specifically, we first define the row- 
monomial DOSTBCs-CPI and then derive an upper bound of the data-rate of the row-monomial DOSTBC-CPI. 
We present some numerical results in Section [V] and conclude this paper in Section |VH 

Notations: Bold upper and lower letters denote matrices and row vectors, respectively. Also, diag[xi, • • • tXk] 
denotes the K x K diagonal matrix with a;i, • • • , xk on its main diagonal; the all-zero matrix; / the identity 
matrix; det(-) the determinant of a matrix; [■]k the fc-th entry of a vector; [■]ki,k2 the (fci, fc2)-th entry of a matrix; 
(•)* the complex conjugate; (•)^ the Hermitian; (•)^ the transpose. Let X = [xi, • • • ; xk] denote the matrix with 
Xk as its fc-th row, \ < k < K . For a real number a, [a] denotes the ceiling function of a. 

^\ matrix is said to be row-monomial (column-monomial) if there is at most one non-zero entry on every row (column) of it [30]. 
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II. System Model 



Consider a cooperative network with one source, K relays, and one destination. Every terminal has only one 
antenna and is half-duplex. Denote the channel from the source to the fc-th relay by hk and the channel from the 
fc-th relay to the destination by fk, where hk and are spatially uncorrected complex Gaussian random variables 
with zero mean and unit variance. We assume that the destination has full CSI, i.e. it knows the instantaneous 
values of hk and fk by using pilot signals; while the source has no CSI. The relays may have partial CSI and this 
will be discussed in detail later. 

At the beginning, the source transmits N complex-valued information-bearing symbols over N consecutive time 
slotslfl Let s = [si, • • • , Sat] denote the information-bearing symbol vector transmitted from the source, where the 
power of s„ is Es- Assume the coherence time of hk is larger than N\ then the received signal vector y^. at 
the fc-th relay is y^. = hkS + rife, where Uk — [nk,i, ■ ■ ■ ,nk,N] is the additive noise at the fc-th relay and it is 
uncorrected complex Gaussian with zero mean and identity covariance matrix. All the relays are working in the 
amplify-and-forward mode and the amplifying coefficient p is y/ Er/{1 + Es) for every relay, where Er is the 
transmission power at every relayQ Based on the received signal vector y^., the fc-th relay produces a transmitted 
signal vector and forwards it to the destination. 

Firstly, we present the system model of the DOSTBCs, which will be studied in Section |III1 We assume that 
the fc-th relay has no CSI of the first hop. This can be true when the relays do not have any channel estimation 
devices due to strict power and/or size constraints]^ Then the fc-th relay produces the transmitted signal vector 
as follows: 



The matrices Ak and Bk are called the associated matrices. They have the dimension of N xT and their properties 
will be discussed in detail later Assume the coherence time of fk is larger than T. The received signal vector at 

'if the information-bearing symbols are real- valued, one can use the rate-one generalized real orthogonal design proposed by [31] in the 
cooperative networks without any changes. The codes achieve the single-symbol ML decodability and the full diversity order [27]. Therefore, 
we focus on the complex-valued symbols in this paper. 

'We set p = -y/ Er/ {1 + Es) as in many previous publications including [18], [23], [28], [29]. This ensures the average transmission power 
of every relay is Er in a long term. 

'Even when the relays can not estimate the channels, the destination is still able to obtain the full CSI. This is because the destination usually 
dose not have any power or size limitation, and hence, it can be equipped with sophisticated channel estimation devices. Furthermore, although 
the relays can not estimate the channels, they can forward the pilot signals from the source to the destination, and they can transmit their own 
pilot signals to the destination. Based on those pilot signals, the destination is able to obtain the full CSI, which has been discussed in [16] and 
[27]. 




p{ykAk + ylBk) 



phksAk + phls*Bk + pukAk + pnlBk- 



(1) 
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the destination is given by 

K 

Vd = ^''^k + 
fc=i 

K K 

= ^{pfkhksAk + pfkhls*Bk) +^{pfknkAk + pfknlBk) + rid, (2) 

fe=i fe=i 

where = [nd,i, • • • , n^^r] is the additive noise at the destination and it is uncorrelated complex Gaussian with 
zero mean and identity covariance matrix^ Define wd, Xd, and no as follows: 

WD = [pfl,---,PfK] (3) 

Xd = [hisAi + hls*Bi;---;hKsAK + h}^s*BK] (4) 

K 

no = ^{pfknkAk + pfknlBk) + na; (5) 



fe=i 

then we can rewrite (|2|l in the following way 

ya = WDXD + nD. (6) 

Furthermore, from (|5]), it is easy to see that the mean of n^) is zero and the covariance matrix R of nn is given 
by 

K 

R = Y.{\pf>'\'{^kAk + BfBk))+I. (7) 

A,-l 

Secondly, we present another system model, which is for the row-monomial DOSTBCs-CPI studied in Section 
IIVI We assume that there is no strict power or size constraint on the relays and the relays can obtain partial CSI 
of the first hop by the equipped channel estimation devices. Specifically, we assume the fc-th relay has the CPI of 
the first hop, i.e. it knows the phase 9k of the channel coefficient hku Note that this assumption does not imply 
more pilot signals compared to the assumption that relays have no CSI of the first hop. Actually, in order to make 
the destination have full CSI, the relays always need to forward the pilot signals from the source to the relays. 
Furthermore, the relays always need to transmit their own pilot signals to the destination. Therefore, the same 
amount of pilot signals is needed in all circumstances. Furthermore, the assumption that the relays have the CPI of 
the first hop does not imply any feedback overhead, because the relays do not need to have any CSI of the channels 
from themselves to the destination. 

*We assume that there is no direct hnk between the source and destination. The same assumption has been made in many previous 
publications [21], [23], [24], [29]. Furthermore, perfect synchronization among the relays is assumed as in [20], [21], [23], and [27]-[29]. 
Although synchronization is a ciitical issue for the practical implementation of cooperative networks, it is beyond the scope of this paper 

'in this paper, we assume that the relays can estimate without any errors as in [27]-[29]. It will be interesting to study the scenario when 
the relays do not have perfect estimations of S^; but it is beyond the scope of this paper. 
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Based on the CPI, the fc-th relay first obtains by = e ^ '"y^. and then builds the transmitted signal vector 
as 

a'fc = piykAk + y^*Bk) 

= p\hk\sAk + p\hk\s*Bk + pe-^'^'-UkAk + pe^'^nlBk. (8) 
Consequently, the received signal vector at the destination is given by 

yc = wcXc+nc, (9) 

where 

WC = [pfl\hil---,pfK\hK\] (10) 

Xc = [sAi + s*Bi;---;sAk + s*Bk] (11) 

K 

nc = Y.'^pfke'^'>''nkAk+pfke''''"nlBk)+nd. (12) 

fc=i 

From ( fT2b . it is easy to see that the mean of nc is zero and the covariance matrix R of nc is still given by (|7]|. 

III. Distributed Orthogonal Space-Time Block Codes 

In this section, we abandon the row-monomial limitation, which was adopted in the construction of the row- 
monomial DOSTBCs in [1]. Thus, the codes possibly generate correlated noises at the destination. However, we 
still keep the CSI limitation, i.e. the relays do not have any CSI. It is easy to see that such codes are just the 
DOSTBCs proposed in [1], whose definition is as follows. 

Definition 1: A K x T code matrix X^i is called a DOSTBC in variables si, - ■ ■ ,sn if the following two 
conditions are satisfied: 

Dl.l) The entries of Xd are 0, ±/ifeS„, ±/i^s*, or multiples of these indeterminates by j, where j = — 
D1.2) The matrix Xd satisfies the following equality 

XdR-'x'^ = IsipDi + .-. + ls^pD^, (13) 
where D„ = diag[\hi\'^Dn,i, ■ ■ ■ , and Dn,i, ■ ■ ■ ,Dn,K are non-zero. 

In [1], it has been shown the DOSTBCs are single-symbol ML decodable and achieve the full diversity order 
K. However, the bandwidth efficiency of the DOSTBCs has not been analyzed in [1]. Thus, it is still unknown 
if removing the row-monomial limitation can improve the bandwidth efficiency or not. In order to answer this 
question, we derive an upper bound of the data-rate of the DOSTBC in the following. To this end, one may think 



of redefining wd and as follows 

WD = [pfi\hi\,---,pfK\hK\] (14) 

Xd = [sAi+ s*Bi;---;sAk + s*Bk], (15) 
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where Ak — e^^^Ak and Bk — e^^^''Bk- Then Xd can be seen as the generaUzed orthogonal designs, and 
hence, the results in [32] may be directly used. Actually, this method will make the analysis more complicated. 
Note that the new associated matrices Ak and Bk have a fundamental difference with the associated matrices of 
the generalized orthogonal design in [32]. That is, Ak and Bk contain Ok, which is a random variable. Due to 
this reason, it is very hard to find the properties of Ak and Bk by using the results in [32], and hence, it is very 
complicated to derive an upper bound by using ( fT4l ) and ( fTsT l. Instead of this approach, in this paper, we define 
W]j and as in ([3]) and (|4|i, respectively, and derive an upper bound of the data-rate by analyzing the properties 
of Ak and Bk- Some fundamental properties of Ak and Bk are given in the following lemma at first. 

Lemma 1: If a DOSTBC X jj in variables si, - ■ ■ ,S]y exists, its associated matrices Ak and Bk are column- 
monomial. Furthermore, the orthogonal condition ( fT3] ) on X^ holds if and only if 

Ak,R-'A^^ = 0, h^k2 (16) 

Bk,R-^B^^ = 0, h^k2 (17) 

Ak,R-'Bg+BlR-'Al - 0, (18) 

Bk,R-'Al+AlR-'Bl = 0, (19) 

AkR-'Aj^ + B*kR-'Bl = diag[Z?i,fc,---,i?w^fc]. (20) 
Proof: By following the proof of Property 3.2 in [30], it is very easy to show that Ak and Bk are column- 
monomial. Furthermore, by following the proof of Lemma 1 in [1] and the proof of Proposition 1 in [32], it is not 
hard to show ([T6ll-(|20ll. □ 
Lemma [T] gives us some fundamental properties of Ak and Bk- But, due to the existence of R^, we can not 
obtain an upper bound by using the conditions (fT6Tl-(l20]l directly. Therefore, we simplify those conditions in the 
following theorem by eliminating R^. 

Theorem 1: If a DOSTBC Xd in variables si, - ■ ■ ,sm exists, we have 

XpXg = \s^\^E^ + --- + \sn\^E^, (21) 

where En = dia.g[\hi\'^En,i, ■ ■ ■ , \hK\'^ En,K] and -En,i, • • • , En,K are strictly positive. Equivalently, the associated 
matrices Ak and Bk satisfy the following conditions 

Afe.Af^ = 0, fci^fca (22) 

Bfc^Bf^ = 0, fci^fca (23) 

Ak.Bg+B^Al = (24) 

Bk.Ag+Al^Bl = (25) 

AkA^ + BlBl = dmg[E,,k,---.EN,k]. (26) 
Proof: See Appendix A. □ 
After comparing Theorem [T] and the definition of the generalized orthogonal design in [32], it seems that the 
DOSTBCs are in a subset of the generahzed orthogonal design. However, note that there is a fundamental difference 
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between the DOSTBCs and the generahzed orthogonal design. That is, the code matrix Xe, of a DOSTBC contains 
the channel coefficients h^- Actually, this fundamental difference explains why Ak and Bk in (fTsT i contain Ok- 
Furthermore, this fundamental difference induces the conditions (l22l i and (l23T l. Those two conditions help derive 
an upper bound of the data-rate of the DOSTBC in the following theorem. 

Theorem 2: If a DOSTBC X^) in variables si, • • • , sjv exists, its data-rate TLd satisfies the following inequahty: 

Proof: See Appendix B. □ 
Theorem|2] suggests that the DOSTBCs have approximately twice higher bandwidth efficiency than the repetition- 
based cooperative strategy. Furthermore, it is worthy of addressing that the DOSTBCs have the same decoding 
complexity and diversity order as the repetition-based cooperative strategy. On the other hand, when N and K 
are both even, the upper bound of the data-rate of the DOSTBC is exactly the same as that of the row-monomial 
DOSTBC proposed in [1]. Therefore, one can use the systematic construction method developed in Section V-A 
of [1] to build the DOSTBCs achieving the upper bound of dZTl l for this case. For the other cases when N and/or 
K are odd, the upper bound of the data-rate of the DOSTBC is larger than that of the row-monomial DOSTBC. 
Unfortunately, we have not found any DOSTBCs achieving the upper bound of (|27] | for those cases. 

On the other hand, like the row-monomial DOSTBCs, the DOSTBCs may have poor bandwidth efficiency, when 
there are many relays. This is because the upper bound ( |27] | decreases with the number K of relays. This problem 
can be solved very well when the relays can exploit the CPl to construct the codes, which will be shown in the 
next section. 

IV. Row-Monomial Distributed Orthogonal Space-Time Block Codes with Channel Phase 

Information 

In this section, we remove the CSI limitation and assume the relays use the CPI to construct the codes. But, we 
still keep the row-monomial limitation, in order to facilitate the analysis. Therefore, we define the row-monomial 
DOSTBCs-CPI in the following way. 

Definition 2: A K y. T code matrix Xc is called a row-monomial DOSTBC-CPI in variables si, • • • , sat if it 
satisfies Dl.l in Definition 1 and the following equality 

XcR-^X^ = \s^\''F^ + --- + \sn?Fn, (28) 

where Fn = diag[F„_i, • • • , Fn.n] and Fn.i, ■ ■ ■ , Fn.K are non-zero. Furthermore, it associated matrices Ak and 
Bk, 1 <k < K, sxt all row-monomial. 

It is easy to check that the row-monomial DOSTBCs-CPI are single-symbol ML decodable. By using the technique 
in [1] and [33], it can be shown that the row-monomial DOSTBCs-CPI also achieve the full diversity order. 
Furthermore, by following the proof of Lemma [U and Theorem [U it is not hard to show that a row-monomial 
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DOSTBC-CPI Xc satisfies the follow equality 

XcX^ = IsipGi + .-. + lsArpGjv, (29) 

where G„ — diag[G„,i, • • • , Gu.k] and Gn.i, ■ ■ ■ , Gn.K are strictly positive. Note that the code matrix Xc of a 
row-monomial DOSTBC-CPI does not contain any channel coefficients. By comparing (|29] | and the definition of 
the generalized orthogonal design, we notice that a row-monomial DOSTBC-CPI must be a generalized orthogonal 
design. All the analysis of the generalized orthogonal design in [32] are valid for the row-monomial DOSTBCs-CPI. 
In particular, when K = 2, the data-rate of the row-monomial DOSTBC-CPI can be as large as one by using the 
Alamouti code proposed in [34]; when K > 2, the data-rate of the row-monomial DOSTBC-CPI is upper-bounded 
4/5, which is the upper bound of the data-rate of the generalized orthogonal design [32], 

Actually, the row-monomial DOSTBCs-CPI have some unique properties which the generahzed orthogonal design 
does not have. Those unique properties help find a tighter upper bound of the data-rate of the row-monomial 
DOSTBC-CPI. To this end, we first have the following theorem. 

Theorem 3: Assume Xc is a DSTC in variables si, - ■ ■ ,sn, i-e. every row of Xc contains the information- 
bearing symbols si, • • • , sn- Moreover, assume that the noise covariance matrix R of Xc is diagonal. After proper 
column permutations, we can partition into R^ = diag[i2i, i?2, • • • , Rw] such that the main diagonal entries 
of Rw are all equal to and Ri ^ Rj for i ^ j. After the same column permutations, we can partition Xc 
into Xc = [Xci, ■ ■ ■ , Xcw]- Let Xcw denote all the non-zero rows in Xcw Assume that Xcw contains N^j 
different information-bearing symbols and they are s^, • • • , Then Xc is a row-monomial DOSTBC-CPI if 
and only if every sub-matrix Xcw is a row-monomial DOSTBC-CPI in variables s™, • • • , s]^ . 

Proof: See Appendix C. □ 

Theorem [3] means that, when a DSTC Xc generates uncorrected noises at the destination, the code is single- 
symbol ML decodable as long as it can be partitioned into several single-symbol ML decodable codesF*! Furthermore. 
Theorem [3] is crucial to derive an upper bound of the data-rate of the row-monomial DOSTBC-CPI. This is because 
it enables us to analyze the data-rate of every individual sub-matrix Xcw instead of Xc itself. When Xcw has 
one or two rows, it is easy to see that its data-rate can be as large as one. When Xcw has more than two rows, 
the following theorem shows that the data-rate of Xcw is exactly 1/2. 

Theorem 4: Assume is a row-monomial DOSTBC-CPI and its noise covariance matrix is R. By proper 
column permutations, we can partition R^ into R^ = diaug[R\,R2,- ■ ■ ,Rw] such that the main diagonal 
entries of Rw are all equal to and Ri ^ Rj for i ^ j. By the same column permutations, we can partition 
Xc into Xc = [Xci, • • • , ^cw]- Let Xcw denote all the non-zero rows in Xcw and assume the dimension of 

'"Note that s™, ■ ■ ■ are all from the set s = [si, ■ • • , sjv]. 

"For the rate-3/4 code in [29], it generates uncorrelated noises at the destination; but the main diagonal entries of R are all different. If 
we partition the rate-3/4 code by the way presented in Theorem [3] we will see that every sub-matrix Xcw is actually a column vector with 
more than one non-zero entries. Thus, Xq^ can not be a row-monomial DOSTBC-CPI, and hence, it is not single-symbol ML decodable. By 
Theorem [3] the rate-3/4 code can not be a row-monomial DOSTBC-CPI either and it is not single-symbol ML decodable. 
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Xcw is ifw) X T^. Then the data-rate of Xcw is exactly 1/2 when K^j > 2. 

Proof: See Appendix D. □ 
Based on Theorems [3] and IH we derive an upper bound of the data-rate of the row-monomial DOSTBC-CPI in 
the following theorem. 

Theorem 5: When K > 2, the data-rate TZc of the row-monomial DOSTBC-CPI satisfies the following inequality 



7^c 



N 1 
T - 2' 



(30) 

1 z 

Proof: See Appendix E. □ 
We notice that the data-rate of the row-monomial DOSTBC-CPI is independent of the number K of relays. 
Thus, the row-monomial DOSTBCs-CPI have good bandwidth efficiency even in a cooperative network with many 
relays. Furthermore, compared to the row-monomial DOSTBCs and the DOSTBCs, the row-monomial DOSTBCs- 
CPI improve bandwidth efficiency considerably, especially when the cooperative network has a large number of 
relays. The improvement is mainly because the relays exploit the CPI to construct the codes. As we have seen, 
because the code matrix Xjj of a DOSTBC contains the channel coefficient hk, the conditions (|22]) and (|23] | are 
induced. Those two conditions severely constrain the data-rate of the DOSTBC. On the other hand, by exploiting 
the CPI, the code matrix Xc of a row-monomial DOSTBC-CPI does not have any channel coefficients. Thus, the 
conditions ( |22] | and (|23] | are not induced, and the data-rate is greatly improved. Furthermore, recall that exploiting the 
CPI at the relays does not increase the pilot signals or require any feedback overhead. Compared to the DOSTBCs, 
the only extra cost of the row-monomial DOSTBCs-CPI is that the relays should be equipped with some channel 
estimation devices to estimate 6k- 

Interestingly, the row-monomial DOSTBCs-CPI achieving the upper bound 1/2 are easy to construct and they 
are given in the following theorem. 

Theorem 6: The rate-halving codes developed in [31] can be used as the row-monomial DOSTBCs-CPI achieving 
the upper bound 1 /2 of the data-rate. 

Proof: It is easy to check that the rate-halving codes satisfy Definition 2 and they always achieve the data-rate 



1/2. 

As an example, when iV = 4 and K 
given as follows: 



Xr 



□ 

4, the row-monomial DOSTBC-CPI achieving the upper bound 1/2 is 



(31) 
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V. Numerical Results 

In this section, we present some numerical results to demonstrate the performance of the DOSTBCs and the 
row-monomial DOSTBCs-CPI. In our simulation, we define the average signal to noise ratio (SNR) per bit as the 



February 1, 2008 



DRAFT 



10 



ratio of Er to the logarithm of the size of the modulation scheme. Furthermore, we adopt the power allocation 
proposed in [23], i.e. Eg = KE,.- 

In Fig. [T] we let iV = 4 and K ^ A. For this case, we see that the average bit error rate (BER) performance of 
the DOSTBCs and the row-monomial DOSTBCs-CPI is much better than that of the repetition-based cooperative 
strategy, especially when the bandwidth efficiency is 2 bps/Hz. The DOSTBCs and the row-monomial DOSTBCs- 
CPI have almost the same performance. This is because, when = 4 and K — 4, the DOSTBCs and the 
row-monomial DOSTBCs-CPI have the same data-rate 1/2. Fig. [T] also demonstrates that the performance of the 
DOSTBCs and the row-monomial DOSTBCs-CPI is slightly worse than that of the rate-3/4 code proposed in [29]. 
But, note that the rate-3/4 code is not single-symbol ML decodable, and hence, its decoding complexity is much 
higher than that of the DOSTBCs and the row-monomial DOSTBCs-CPI. In Fig. |2] we set iV = 8 and K 6. 
For this case, the average BER performance of the row-monomial DOSTBCs-CPI is now much better than that of 
the DOSTBCs. This is because, when = 8 and K = 6, the data-rate of the row-monomial DOSTBC-CPI is still 
1/2; while the data-rate of the DOSTBC becomes 1/3. 

VI. Conclusion and Future Work 

In the first part of this paper, we consider the DOSTBCs, where the noises at the destination are possibly correlated 
and the relays have no CSI of the first hop. An upper bound of the data-rate of the DOSTBC is derived. When N and 
K are both even, the upper bond of the data-rate of the DOSTBC is exactly the same as that of the row-monomial 
DOSTBC in [1]. When N and/or K are odd, the upper bound of the data-rate of the DOSTBC is larger than 
that of the row-monomial DOSTBC, which means the DOSTBCs can potentially improve the bandwidth efficiency. 
However, we notice that, like the row-monomial DOSTBCs, the DOSTBCs may not have good bandwidth efficiency 
in a cooperative network with many relays, because the upper-bound of the data-rate of the DOSTBC decreases with 
the number K of the relays. In the second part of this paper, we propose the row-monomial DOSTBCs-CPI, where 
the noises at the destination are always uncorrected and the relays exploit the CPI of the first hop to construct the 
codes. We derive an upper bound of the data-rate of those codes and find the actual codes achieving this upper 
bound. The upper bound of the data-rate of the row-monomial DOSTBC-CPI suggests that the row-monomial 
DOSTBCs-CPI have better bandwidth efficiency than the DOSTBCs and the row-monomial DOSTBCs. Moreover, 
the upper bound of the data-rate of the row-monomial DOSTBC-CPI is independent of the number K of the relays, 
and hence, the codes have good bandwidth efficiency even in a cooperative network with many relays. 

Our work can be extended in the following two ways. First, it will be very interesting to consider a more general 
case, where the noises at the destination are possibly correlated and the relays use the CPI of the first hop to 
construct the codes. Intuitively, such codes should have even higher data-rate than the row-monomial DOSTBCs- 
CPI. But, we conjecture that the improvement of the data-rate is just marginal. This is because, by comparing the 
DOSTBCs and the row-monomial DOSTBCs, we notice that removing the row-monomial limitation just slightly 
improves the data-rate. Secondly, we can assume that the relays have the full CSI, including not only the channel 
phase 9k but also the channel magnitude \hk\, of the first hop and use this information in the code construction. 
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We notice that the use of the channel magnitude \hh\ only affects the structure of the noise co variance matrix R; 
but it can not change the structure of the code matrix Xc- Therefore, we conjecture that the data-rate can not be 
improved by assuming the relays have the fuU CSI of the first hop. 

Appendix A 
Proof of Theorem [T] 

By following the proof of Lemma [T] it can be easily shown that (I2TI 1 is equivalent with the conditions (l22l l-(l26ll. 
On the other hand, if a DOSTBC Xd exists, ([T3]l holds by Definition 1, and hence, (fT6Tl-(l20]l hold by Lemma [l] 
Therefore, in order to prove Theorem [U we only need to show that, if (fT6]l-(l20b hold, (I22l i-(|26]| hold and En^k is 
strictly positive. 

We start our proof by evaluating [J2]tj^(2 ™<i [-^~^]ti.t2- According to (O, when ti ^ t2, [R]ti.t2 can be either 
null or a sum of several terms containing |/o/fep; when ti = t2 = t, [R]t.t is a sum of a constant 1, which is from 
the identity matrix, and several terms containing \pfk\'^- Therefore, we can rewrite [R]t^t as [R]t.t = Rt.t + !> where 
Rt^t accounts for all the terms containing |p/fep. [R^'^]ti,t2 is given by [i2~^]tj_f2 — C't2,ti/det(-R), where Ct2.ti 
is the matrix cofactor of [R]t2.ti- When ti —t2 — t, by the definition of matrix cofactor, Ct.t contains a constant 1 
generated by the product Y[i=i i^t{-^]i,i ~ TlILi i^ti-^i,i + Furthermore, it is easy to see that the constant 1 is 
the only constant term in Ct.t- Thus, Ct^t can be rewritten as Ct^t — Ct^t + 1 and there is no constant term in Ct^t- 
Consequently, [R^^]t.t can be rewritten as [i2^^]f_t — C't_t/det(i2) + l/det(i?). When ti ^ t2, Ct2.ti does not 
contain any constant term, and hence, [R^^]t^,t2 does not contain the term l/det(ii)0 Therefore, we can extract 
the term l/dct(i?) from every main diagonal entry of R^ and rewrite R^ in the following way 

det(J?) dct(il) 
Then we show that ( l22T l holds if ( fTSI l holds. If (fTSI l holds, we have 

- ^)^^^^< + d^^^^< - °- (^-^^ 
Note that R^ and C are random matrices. In order to make ( |A.2| l hold for every possible R^ and C, both terms 
in ( IA.2I 1 must be equal to zero. Therefore, ( |22] | holds. Similarly, we can show that (|23]l-(|25ll hold if (fT7li-(fT9b hold. 
Now, we show that ( l26b holds if (l20l l holds. If (l20b holds, we have 



R^ = -T-T7^C + ——I. (A.l) 



AkR^^A- + BlR-^Bl = ^AA.CA^ + BlCBl)+' (AkA- 



det(H) V " ft ' ft ' det(il) 

For the same reason as in ( IA.2b . the off-diagonal entries of A^A^ + B\BJ. must be zero, and hence, ( l26b holds. 

'^Ct2,ti may be zero; but it does not change the conclusion that {R~^]ti,t2 does not contain the term l/dct{i?). 



February 1, 2008 



DRAFT 



12 



Lastly, we show that En^k is strictly positive if ( l20b holds. From ( l20b and ( l26b . we have 



T T 



t=l i=l 
T 

S„,fc = ^(|[Afe]„,4p + |[B,]„^,|2). (A.5) 



t=i 



Since Z3„^fe is non-zero, at least one [Afc]„.t or one [Bk]n.t is non-zero. Therefore, En^k = X]t=i(l[^fe]n,tP 
[[B/cJn^tP) is strictly positive, which completes the proof of Theorem [T] 

Appendix B 
Proof of Theorem |2] 

Let ^ = [Ai,---, AkV and B = [-Bi, • • • , BkV; then the dimension of A and B is NK x T. From (l22l l. 
every row of A^,^ is orthogonal with every row of when fci 7^ ^2 1^ Furthermore, because Ak is column- 
monomial by Lemma [T] every row of A^ is orthogonal with every other row of Ak- Therefore, any two different 
rows in A are orthogonal with each other, and hence, raink^A) = SaLi i'ank(Afe). Similarly, any two different 
rows in B are orthogonal with each other, and hence, rank(B) — X^fcLi i'ank(Bfc). 

On the other hand, from ( |26] |, we have 



rank(Afc) + rank(Bfe) > rank(diag[£;i,fc, • • • , EN,k]) = N, (B.l) 
where the inequality is from the rank inequality 3) in [32], and hence, 

K K 

^ rank(Afe) + ^ rank(Bfc) > NK. (B.2) 

fc=i fe=i 

Because rank(74.) and rank(B) are integers, we have 

K 

rank(A) = rank(Afc) > 



k=l 

or 



2 



(B.3) 



rank(B) ^rank(Bfc) > 



(B.4) 



fe=i 

If ( IB.3I ) is true, T > rank(A) > \{NK)/2\ and (|22l) holds. If ( IB.4b is ti-ue, the same conclusion can be made. 

Appendix C 
Proof of Theorem [3] 

The sufficient part is easy to verify. Thus, we focus on the necessary part, i.e. if Xq is a row-monomial 
DOSTBC-CPI, all the sub-matrices Xcw are also row-monomial DOSTBCs-CPI. Assume that the dimension of 
R 

"a row vector x is said to be orthogonal with another row vector y if xy^ is equal to zero. 
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Firstly, we show that XcwRwXfj^ is a diagonal matrix. Based on (|28] |. when ki k2, [XcR ^X^]ki^k2 is 
given by 

W Tu, 

[XcR ^Xc]ki^k2 = ^ ^^[Xcw]kiA^Cw]*k2,fRw = 0. (C.l) 

w=l t=l 

If all the terms in this summation are zero, it is trivial to show that Y^f^iiX cw]ki ,t[-^ Ciu]k2 t^i" ~ ^ f^'" 1 ^ ^ ^ 
W. Because Xcw contains all the non-zero entiies of Xcw, we have [XcwRwXc^]kj^^k2 — [XcwRwXQ^]kj^^k2 = 
HtZA^ Cw\ki,t[^ CwYk^^t^w = 0, which means XcwR-wX^w is a diagonal matrix. 

If there is one term [Xcwi]ki,ti [^Cwi]%^ ti-^wi 7^ 0, some other terms must cancel this term in order to make 
(IC.lb hold. Actually, the non-zero term [Xcwi\ki.ti [-^Ctojfcj ti-^wi must be cancelled by exactly one other term. 
This can be shown by contradiction. We assume that [Xcwi\ki.ti[Xcwi]k2 ti-^wi is cancelled by two other terms 
together, i.e. 

[Xcwt]kt,tt[Xcwt]*k2,ti-^^i + [XciU2]kt,t2[Xcw2]*k2,t2-^^2 + [-'^Cuia ] fei ,t3 [-'^Cuia ] fca ,t3 ^""s = 0- (C-2) 

In order to make this equality hold, one of the following three equalities must hold: 1) [Xcw2]ki.t2 = i[^CtDi]fei,ti; 
2) [Xctualfci.ta = ±[Xcwi]kuti; 3) ±[Xcw2\ki.t2 = ± [^Cu-alfei ,(3 = [-^Ctojfc^ ,ti ■ However, those three equali- 
ties all contradict with our assumption that the covariance matrix R is diagonal. For example, we assume \Xcwi]ki ,ti = 
Sn^, I <n< N^^, and the equality [Xcw2\kt,t2 = ±[^c«Ji]fci,ti holds. Thus, [Xcu,Jfci,t2 = isj?' and s^^ is 
transmitted in the fci-th row of Xc for at least twice. This makes the noise covariance matrix R non-diagonal, 
which contradicts with our assumption. If we assume [Xcwi]ki.ti[Xcwi]*k,^ ti-^tui is cancelled by more than two 
other terms, the same contradiction can be seen similarly. Thus, [Xcwi]ki,ti [Xcwi]%^ ti-^wi is cancelled by exactly 
one other term in the summation (IC.lb and we have 

[X Cwi]kiAi[X. Cwi]*k2,tiRwi + [-^CtU2]fcl,t2 [^CM)2]fc2,t2-^™2 — 0" 

Furthermore, because Ri 7^ Rj when i 7^ j, (IC.3b also implies that R^^ = Rw2 and wi = W2- This means 
that, if one term in the summation jC.lb is non-zero, it must be cancelled by exactly one other term, which 
is from the same sub-matrix Xqw Therefore, we have Y^'^^ii-^ Cw]ki,t[X Cw]1^ t^w = when fci 7^ k2- 
Because Xcw contains all the non-zero entries of Xcw, we have [X cwRw^^J\ki,k2 — CwRtuX^JlkiM — 
Y.tZi[^Cw\kiA^Cw\*k^_^tI^w = 0, when fci 7^ ^2. Therefore, XcwRwXq^ is a diagonal matrix. 

Secondly, we show that the information-bearing symbols s™, • • • , s}5J^^ are contained in every row of Xcw 
Because every main diagonal entry of R^ is the same, it follows from (Q that every column in Xcw has non-zero 
entries at the same rows. Therefore, the non-zero rows in Xcw does not contain any zero entries. Since Xcw 
contains all the non-zero rows in Xcw, every entry in Xcw is non-zero. Then we assume that [Xcw\ki,ti — s™, 
1 < n < N.uj. Because every entry in Xcw is non-zero, we can find another non-zero entry [Xcw\k2M^ ^1 7^ ^2, 
from the ti-th column of Xciu- Thus, [XcwR-wX^AkiM must contain the term [Xcw\kuti[^cw]*k^^ti^w 
Because [XcwRw^cw]ki,k2 = 0' [^Ctu]fei.ti [^Ctulfe^ ti-^w must be cancelled by another term and we assume it 
is [Xcui]fci,t2 [-^Ciii]fe2,t2^«" ^1 ^ In order to make [Xcu,]fci,ti [Xciii]fc2,ti-Rtu + [-^Ciu]fci,t2 [^Ctu]fe2,t2-^t" = 0, 
we must have [Xcw\ki,t2 = ±[Xcw\kiM or [Xcw\*k^^t2 = '^[^cw\ktM- Due to the row-monomial condition. 
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[^Cw]kut2 can not be ±[Xctu]ki,ti, and hence, we have [Xcw]k2,t2 = M^Cw]*k^,t^ = ±sJi^*- This means that 
the /c2-th row contains the information-bearing symbol sj^ as well. Taking a similar approach, we can show that 
the information-bearing symbols s^", • • • , sj^^ are contained in every row of Xcw 

Because XcwRwXfjjj^ is a diagonal matrix and every row of Xcw contains all the information-bearing symbols 
s™, ■ ■ • I s^^, XcwRw^Q^ can be written as 

Xc^oR^.Xc^ = kri'Mi + • • • + Is'^J^Mn^, (C.4) 

where Mn are diagonal and all the main diagonal entries are non-zero. Note that, if the relays only transmit Xcw 
to the destination, is actually the inverse of the noise co variance matrix at the destination. This is because Xcw 
and -R^ are obtained after the same column permutations. Therefore, iCA\ is equivalent with ( |28] ). Furthermore, 
since Xcw is a sub-matrix of Xc, it automatically satisfies Dl.l and the row-monomial condition. Thus, we 
conclude that Xqw satisfies Definition 2 and it is a row-monomial DOSTBC-CPI. 

Appendix D 
Proof of Theorem |4] 

From Theorem [3j every sub-matrix Xciu is a row-monomial DOSTBC-CPI in variables s^", • • • , sj^^- Further- 
more, by ( |29] l, every sub-matrix Xcw is also a generalized orthogonal design. For convenience, we refer to any 
entry containing s^^ as the -entry. Similarly, any entry containing sj^* is referred to as the s™* -entry. 

By the row-monomial condition, any row in Xcw can not contain more than one sj^^ -entry or sj^* -entry. 
Therefore, the data-rate of Xcw is lower-bounded by 1/2, which is achieved when every row contains exactly one 
s™^ -entry and one s™* -entry for 1 < < iV^. 

Then we show that the data-rate can not be strictly larger than 1/2 by contradiction. Without loss of generality, 
we assume the first row of Xqw is [s^", • ■ ■ , s^^^,s^*, • • • , s™* ], where N'^ < N^- Hence, the data-rate of Xcw 
is Nw/{Nw + N^) and it is strictly larger than 1/2. Furthermore, because every entry in Xcw is non-zero, this 
assumption also means that every row in Xcw contains exactly N^+N^ non-zero entries. Because s^l +i' ' ' ' ' '^w* 
are not transmitted by the first row, the second row can not have any sj^^ -entries, + 1 < < This can be 
shown by contradiction. For example, if the second row has s™ , on the first column, the inner product of the first 
and second rows must have the term sfs^*, Because Xcw is a generalized orthogonal design, the inner product 
of any two rows must be zero. In order to cancel the term sfs"^*, the first row must have an s^* ^^-entry, which 
contradicts our assumption. Thus, the second row can not contain any sj^^ -entries, iV,„ + 1 < n^, < iV^,. On the 
other hand, because the second row must contain exactly iV^, + A^^ non-zero entries, it must have the s™^ -entries 
for 1 < n.u, < N'^ and the sj^'* -entries for 1 < < Nw 

Since > 2, we can do further investigation on the third row of Xcw The third row is decided by the 
first and the second row jointly. Because the first row does not have s™T s^* , the third rows can not 

have any sj^'^ -entries, N.^^ + 1 < n.w < N.^]. Furthermore, because the second row does not have any sj^^ -entries, 
+ 1 < < Nw, it can be easily shown that the third row can not have any sj^* -entries, + ^ < n-w < -/V„. 
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Hence, the third row can only have the s™^ -entries and the s™* -entries for 1 < n,„ < N^. There are at most 2N^ 
non-zero entries in the third row and it contradicts with the fact that every row in Xcw contains exactly N^j + 
non-zero entries. This means that the data-rate of Xcw can not be strictly larger than 1/2. Because it has been 
shown that the data-rate of Xcw is lower-bounded by 1/2, we conclude that the data-rate of Xcw is exactly 1/2 
when Kyj > 2. 

Appendix E 
Proof of Theorem |5] 

Like in Theorems [3] and S] we still partition Xq into Xc = [Xci,- ■ - jXcw]- Let X^ denote the matrix 
containing all the sub-matrices Xcw with k non-zero rows, and hence, Xc = [Xc, ■ ■ ■ , X^]. Furthermore, 
assume the total number of non-zero entries in X^ is Pk, and hence, X^fcLi is the total number of non-zero 
entries in Xc- For convenience, we refer to any entry containing s„ as the s„-entry. Similarly, any entry containing 
s* is referred to as the s*-entry. 

In order to derive the upper bound of the data-rate, we first consider the case that K = 3. For this case, Xq 
contains at most one sub-matrix and we assume X^ = Xci- Thus, X^ is a row-monomial DOSTBC-CPI and its 
data-rate is exactly 1/2 by Theorem]?] Furthermore, we assume X^ is in variables si, ■ ■ ■ , sni, 1 < -^^i < N. By 
the proof of Theorem]!] every row of Xq contains exactly one s„-entry and one s*-entry, 1 < n < A^i. Therefore, 
there is no s„-entry or s* -entry in X]j and X^, 1 < n < iVi; otherwise, there will be two s„-entries or two s*- 
entries in a row of Xc, which will make the noise covariance matrix R non-diagonal. Thus, the matrix [Xq, Xq] 
is actually a row-monomial DOSTBC-CPI in variables s„+i, • • • ,sn. Furthermore, because every column in the 
matrix [Xq,Xq] has at most two non-zero entries, it is easy to show that its data-rate can not be larger than 
1/2 by following the proof of Theorem 2 in [1]. Because the data-rate of X^ is exactly 1/2 and the data-rate of 
is less than 1/2, the data-rate of Xc — [Xc,Xq,Xq] must be upper-bounded by 1/2 when K ^ 3. 

Secondly, we consider the case that K > 3. When fc > 2, the data-rate of X^ is exactly 1/2. This means, if an 
information-bearing symbol s„ appears in a row of X^, it appears exactly twice. On the other hand, ( l29l l implies 
that every row of Xc must have the information-bearing symbol s„ for at least once, 1 < n < A^. Therefore, the 
following inequahty holds 

2 K 

k=l k=3 

On the other hand, there are totally Pk/k columns in Xq. Thus, the total number T of columns in Xc = 
[Xc, ■ ■ ■ ,Xc] is given by 

^ A 

k=l 

By ( IE. II ) and ( IE.2I ). it is easy to obtain 2N < T under the assumption that K > 3, and hence, the data-rate of Xc 
is upper-bounded by 1/2 when K > 3. 
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Fig. 1. Comparison of the rate-3/4 code from [29], the DOSTBCs, the row-monomial DOSTBCs-CPI, and the repetition-based cooperative 
strategy, N = A, K = A. 
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Fig. 2. Compaiison of the DOSTBCs and the row-monomial DOSTBCs-CPI, N = 8, K = 6. 
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